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EMBEDDED QUADTREE WAVELETS IN IMAGE COMPRESSION 



Field of the Invention 

The present invention relates generally to image coding, and more particularly to 
compression and decompression of digital images. 

Background of the Invention 

The advent of multimedia computing has created an increased demand for high- 
performance image compression systems. In the last few years, the wavelet transform has 
become a mainstream, base technology for image compression coding. Wavelet 
transforms, otherwise known as hierarchical subband decompositions, result in multi- 
resolution decomposition hierarchy (MDH) representations of the source image as 
illustrated in Fig. 1. Bit rates lower than 1 bit/pixel can be achieved through the efficient 
coding of the wavelet transform coefficients generated in the production of the MDH 
data. 

A most important and beneficial characteristic of the wavelet coefficients generated by 
the transform is that most of the coefficients will possess very small amplitudes that will 
reduce to zeros after scalar quantization. For many image processing purposes, the 
importance or significance of a wavelet transform coefficient can be measured by its 
absolute value in relation to predetermined threshold values. A wavelet coefficient is said 
to be significant or insignificant, in relation to a particular threshold value, depending on 
whether or not its magnitude exceeds that threshold. The importance of a set of wavelet 
coefficients can be collectively ascertained using a "significance map". A "significance 
map" is a bitmap recording the location of the significant coefficients. A large fraction of 
the bit budget may be spent on encoding the significance map. Therefore, the 
compression performance of an image coding system largely relies on its efficiency in 
coding the significance map. 
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other words, the zerotree representation of objects inherently prevents independent 
decodability. This inseparability also introduces a higher susceptibility to bit errors. A 
single bit error could potentially/after interpretation at each succeeding resolution level, 
lead to decoder derailment. Finally, the closed structure of zerotree representation makes 
it difficult to add in new coding methods or features. 

Summary of the Invention 

The present invention is a method of compressing grayscale and color image data with a 
high degree of compression performance. An objective of the present invention is to 
provide a compressibly efficient, fast, method and system to code the significance 
information of the wavelet transform coefficients. A further objective is to provide a 
method and system of producing a compressed bit-stream that is scalable, region-based 
accessible, robust to errors, and independently decodable. The present invention provides 
a logically simple and fast method of coding that possess and a high degree of parallelism 
that lends itself to hardware implementation. The bit-stream produced by the present 
system is more robust to bit error than the prior art since all sub-band blocks are encoded 
independently and errors at one scale will not lead to errors in other scales. 

The invented coding process comprises the steps of forming a wavelet transform of the 
image, setting an initial threshold and initial list of insignificant blocks, forming the list 
of significant coefficients by encoding of significant map based on its quadtree 
representation, reducing the threshold, sending of refinement bits of those significant 
coefficients. The invention also includes a method and system to decode image files that 
have been encoded using the present invention. 

Brief Description of the Drawings 

Figure 1 is a schematic illustration of a three-layer wavelet decomposition. 
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This procedure is performed recursively to produce a multi-resolution decomposition 
hierarchy (MDH) of the original image. This is illustrated in Fig.l where three levels of 
transformation have been applied. Of course, the skilled reader will appreciate that an 
arbitrary number of sub-band decompositions may be applied. 

In Fig. 1 the lowest frequency sub-band i.e the sub-band that provides the coarsest 
resolution scale, is that at the top, left-most block 101 represented by LL3. The highest 
frequency sub-bands or those at the finest resolution scale are the blocks HL1 102, LH1 
103,andHHl 104. 

Figure la is a graphic illustration of the present invention's three layer wavelet 
decomposition of the test image Lena. The original image laOl can be seen to have 3 
levels of resolution in the decomposed image la02. The high frequency data of HH1 104 
can be seen to offer the most detail in the bottom, right-most block la03. 



After a wavelet transform has occurred, each pixel is represented by a wavelet transform 
coefficient. In the preferred embodiment of the current invention, each of these 
coefficients is represented in a fixed-point, binary format, most typically with less than 16 
bits, and treated as an integer. Fig. 2 illustrates the binary representation in the general 

20 case of a wavelet transform coefficient. In this system, the first bit 201 is dedicated to 

represent its sign -- positive or negative. The first non-zero bit 202 following the sign bit 
is called the leading one bit or LOB. The position of the LOB is determined by the 
magnitude of the coefficient. That is to say that the larger the value of the coefficient, the 
more closely after the sign bit will it occur. All of the bits following the LOB 202 are 

25 called refinement bits 203. 

After the coefficients are generated in the wavelet transformation and are given their 
binary representation, three lists are initialized. The first of these is called the list of 
significant pixels or LSP. Each entry in LSP corresponds to an individual pixel on the 
30 MDH plane and is identified by a pair of coordinates (i j). The LSP is initialized as an 
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that the number of significant coefficients is one, and the size of the block is one, this 
entry is a single coefficient and its coordinates are moved to LSP. 

There are two methods available to process the sub-blocks. The first method, known as 
5 depth-first quadtree coding, inserts the four sub-blocks into LIB immediately following 
the position of their parent block. The four child blocks are then evaluated immediately 
with respect to their significance and this operation is applied recursively until no more 
subdivision is possible. When all significant coefficients in this block are found and 
moved into LSP, the coding of the present entry is completed. The process then moves to 
10 the next block in the LEB. 

The second method, or breadth-first quadtree coding, adds these four sub-blocks to the 
end of LIB where they are evaluated before the same pass ends. With the breadth-first 
process, all parent squares at the same level will be processed before any blocks of the 
next generation. 

After all entries in the present LIB have been processed at one level of significance, the 
entries in TLIB are reordered according to the size of the block: each block must be put 
before those blocks with larger size so that it can be processed first for the next threshold. 
20 Most pixels adjacent to significant pixels have been moved into TLIB as pixel level 
entries if not significant to the present threshold. Due to the correlation of adjacent 
coefficients, it is very likely that these adjacent pixels will be significant at the next 
threshold level. In the event of a strict bit budget, we must put these pixel level blocks 
first to ensure that precious bits are not used to find significant coefficients from big 
25 blocks, and risk missing pixel level significant coefficients. The reordering of TUB will 
therefore aid the encoding of more significant coefficients using fewer bits. While not 
essential, experiments show that higher PSNR will be achieved using this reordering 
scheme. The final step in this quadtree process is to replace the LIB with TLIB for 
subsequent scanning at the next level of significance and to reset TLIB to empty. Before 
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equal sized sub-blocks. If the encoder has used the depth-first method (this decision 
having been made by the encoder and which information is contained in the header part 
of the bitstream), insert the sub-blocks into LIB at its parent block position. If the encoder 
has used the breadth-first method, add the sub-blocks to the end of LIB. After all entries 
5 in LIB have been decoded, using TLB to replace LIB, which will be processed at the next 
threshold level. The LIB is reordered according to the same rule as in encoding, and the 
TLEB is reset as empty. 

In the refinement pass of the decoding, all coefficients, which have been moved into LSP, 
are updated according to following rule: if the coefficient is negative, then add 2 N " 1 if 
received bit is 0, or subtract 2 N ' K if received bit is 1. On the contrary, if the coefficient is 
positive, then add 2""' if received bit is 1, or subtract 2 A_I if received bit is 0. 

At any point in the encoding or decoding process of the present invention, bit 
consumption may be calculated to determine if the bit budget has been exceeded and the 
process may be halted. In this manner, precise bit rate control can be easily achieved if 
there is no arithmetic coding on the bit stream. With arithmetic coding, the resultant 
bitstream is usually shorter than the desired length. 
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Figure 6. 



figure 7. 
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Figure 9. 




Figure 10. 
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